# RoBERTa Variant
Efficient Mlm M0.15 801010
A RoBERTa model employing pre-layer normalization technology, studying the impact of masking ratio in masked language modeling
Large Language Model
Transformers

E
princeton-nlp
114
0
Efficient Mlm M0.40
A masked language model based on the RoBERTa architecture, employing pre-layer normalization technology to explore the impact of masking ratios on model performance
Large Language Model
Transformers

E
princeton-nlp
117
0
Featured Recommended AI Models